A Level-Synchronous Approach to Ill-formed Sentence Parsing and Error Recovery

نویسندگان

  • Yi-Chung Lin
  • Keh-Yih Su
چکیده

In this paper, a level-synchronous parsing mechanism, named Phrase-Level Building (PLB), is proposed to incorporate wide-scope contextual information for parsing illformed sentences. This mechanism regards the task of parsing a sentence as the task of building the phrase-levels for the sentence. Therefore, the wide-scope contextual information in the phrase-levels can be used to help narrow down the search space and to select the most likely partial parses. Compared with the system which uses both the stochastic context-free grammar and the heuristics of preferring the longest phrase, the proposed PLB approach improves the precision rate for brackets in the partial parse forests from 69.37% to 79.49%. The recall rate for brackets is also improved from 78.73% to 81.39%. The proposed PLB parsing method can also be used to recover errors in illformed sentences, so that more complete syntactic information can be provided in later stages. Experimental results show that 35% of the ill-formed sentences can be recovered to well-formed parses. The recall rate for brackets is also significantly improved from 68.49% to 76.60% while the precision rate for brackets is improved slightly from 79.49% to 80.69%.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Syntactic Recovery and Spelling Correction of Ill-formed Sentences

This paper describes syntactic repair and spelling correction of ill-formed sentences within a context-free grammar using non-static filtering, of ill-formed sentences which violate subjectverb agreement or premodifier-noun agreement. The system described here provides recovery of local trees, reconstruction of the sentence, and spelling correction of detected typographical errors. It also prod...

متن کامل

Integrated Correction of Ill-Formed Sentences

This paper describes a system that performs hierarchical error recovery, and detects and corrects a single error in a sentence at the lexical, syntactic, and/or semantic levels. If the system is unable to repair an erroneous sentence on the assumption that it has a single error, a multiple error recovery system is invoked. The system employs a chart parsing algorithm and uses an augmented conte...

متن کامل

بررسی مقایسه‌ای تأثیر برچسب‌زنی مقولات دستوری بر تجزیه در پردازش خودکار زبان فارسی

In this paper, the role of Part-of-Speech (POS) tagging for parsing in automatic processing of the Persian language is studied. To this end, the impact of the quality of POS tagging as well as the impact of the quantity of information available in the POS tags on parsing are studied. To reach the goals, three parsing scenarios are proposed and compared. In the first scenario, the parser assigns...

متن کامل

An improved joint model: POS tagging and dependency parsing

Dependency parsing is a way of syntactic parsing and a natural language that automatically analyzes the dependency structure of sentences, and the input for each sentence creates a dependency graph. Part-Of-Speech (POS) tagging is a prerequisite for dependency parsing. Generally, dependency parsers do the POS tagging task along with dependency parsing in a pipeline mode. Unfortunately, in pipel...

متن کامل

Error recovery for robust language understanding in spoken dialogue systems

In this paper, we proposed an example-based approach aiming at recovering ill-formed inputs to improve robustness of spoken dialogue systems. In this approach, a treebank, which contains example sentences and their correct parse trees, is used to provide clues for fixing the errors of ill-formed inputs. Particularly, the proposed error recovery method is suitable for spoken dialogue application...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IJCLCLP

دوره 4  شماره 

صفحات  -

تاریخ انتشار 1999